Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 200000 |
| Missing cells | 260000 |
| Missing cells (%) | 10.0% |
| Duplicate rows | 7644 |
| Duplicate rows (%) | 3.8% |
| Total size in memory | 19.8 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 2 |
| Boolean | 2 |
| Dataset has 7644 (3.8%) duplicate rows | Duplicates |
Height is highly overall correlated with BMI | High correlation |
Weight is highly overall correlated with BMI | High correlation |
BMI is highly overall correlated with Height and 1 other fields | High correlation |
Diabetes is highly imbalanced (53.1%) | Imbalance |
Student ID has 20000 (10.0%) missing values | Missing |
Age has 20000 (10.0%) missing values | Missing |
Gender has 20000 (10.0%) missing values | Missing |
Height has 20000 (10.0%) missing values | Missing |
Weight has 20000 (10.0%) missing values | Missing |
Blood Type has 20000 (10.0%) missing values | Missing |
BMI has 20000 (10.0%) missing values | Missing |
Temperature has 20000 (10.0%) missing values | Missing |
Heart Rate has 20000 (10.0%) missing values | Missing |
Blood Pressure has 20000 (10.0%) missing values | Missing |
Cholesterol has 20000 (10.0%) missing values | Missing |
Diabetes has 20000 (10.0%) missing values | Missing |
Smoking has 20000 (10.0%) missing values | Missing |
Student ID is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2023-09-20 09:42:46.623470 |
|---|---|
| Analysis finished | 2023-09-20 09:43:03.786472 |
| Duration | 17.16 seconds |
| Software version | ydata-profiling vv4.4.0 |
| Download configuration | config.json |
Student ID
Real number (ℝ)
MISSING  UNIFORM 
| Distinct | 98976 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49974.042 |
| Minimum | 1 |
|---|---|
| Maximum | 100000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4985.95 |
| Q1 | 24971.75 |
| median | 49943.5 |
| Q3 | 74986 |
| 95-th percentile | 94984 |
| Maximum | 100000 |
| Range | 99999 |
| Interquartile range (IQR) | 50014.25 |
Descriptive statistics
| Standard deviation | 28879.642 |
|---|---|
| Coefficient of variation (CV) | 0.57789285 |
| Kurtosis | -1.2008208 |
| Mean | 49974.042 |
| Median Absolute Deviation (MAD) | 25007 |
| Skewness | 0.0010832903 |
| Sum | 8.9953276 × 109 |
| Variance | 8.340337 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54928 | 2 | < 0.1% |
| 60423 | 2 | < 0.1% |
| 60431 | 2 | < 0.1% |
| 60430 | 2 | < 0.1% |
| 60429 | 2 | < 0.1% |
| 60428 | 2 | < 0.1% |
| 60427 | 2 | < 0.1% |
| 60426 | 2 | < 0.1% |
| 60424 | 2 | < 0.1% |
| 93957 | 2 | < 0.1% |
| Other values (98966) | 179980 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 2 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| 10 | 2 |
| Value | Count | Frequency (%) |
| 100000 | 2 | |
| 99999 | 2 | |
| 99998 | 2 | |
| 99997 | 1 | |
| 99995 | 2 | |
| 99994 | 1 | |
| 99993 | 2 | |
| 99992 | 2 | |
| 99991 | 2 | |
| 99990 | 2 |
Age
Real number (ℝ)
MISSING 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.021561 |
| Minimum | 18 |
|---|---|
| Maximum | 34 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 22 |
| median | 26 |
| Q3 | 30 |
| 95-th percentile | 34 |
| Maximum | 34 |
| Range | 16 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.8905278 |
|---|---|
| Coefficient of variation (CV) | 0.18794137 |
| Kurtosis | -1.2038811 |
| Mean | 26.021561 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.0033658352 |
| Sum | 4683881 |
| Variance | 23.917262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 10882 | 5.4% |
| 27 | 10755 | 5.4% |
| 33 | 10703 | 5.4% |
| 22 | 10691 | 5.3% |
| 25 | 10683 | 5.3% |
| 21 | 10677 | 5.3% |
| 29 | 10676 | 5.3% |
| 34 | 10660 | 5.3% |
| 24 | 10600 | 5.3% |
| 20 | 10566 | 5.3% |
| Other values (7) | 73107 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 18 | 10383 | |
| 19 | 10413 | |
| 20 | 10566 | |
| 21 | 10677 | |
| 22 | 10691 | |
| 23 | 10335 | |
| 24 | 10600 | |
| 25 | 10683 | |
| 26 | 10486 | |
| 27 | 10755 |
| Value | Count | Frequency (%) |
| 34 | 10660 | |
| 33 | 10703 | |
| 32 | 10510 | |
| 31 | 10541 | |
| 30 | 10439 | |
| 29 | 10676 | |
| 28 | 10882 | |
| 27 | 10755 | |
| 26 | 10486 | |
| 25 | 10683 |
Gender
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Memory size | 1.5 MiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.9999444 |
| Min length | 4 |
Characters and Unicode
| Total characters | 899990 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 90005 | |
| Female | 89995 | |
| (Missing) | 20000 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 90005 | |
| female | 89995 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 269995 | |
| a | 180000 | |
| l | 180000 | |
| M | 90005 | 10.0% |
| F | 89995 | 10.0% |
| m | 89995 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 719990 | |
| Uppercase Letter | 180000 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 269995 | |
| a | 180000 | |
| l | 180000 | |
| m | 89995 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 90005 | |
| F | 89995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 899990 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 269995 | |
| a | 180000 | |
| l | 180000 | |
| M | 90005 | 10.0% |
| F | 89995 | 10.0% |
| m | 89995 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 899990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 269995 | |
| a | 180000 | |
| l | 180000 | |
| M | 90005 | 10.0% |
| F | 89995 | 10.0% |
| m | 89995 | 10.0% |
Height
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 98992 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 174.9471 |
| Minimum | 150.00004 |
|---|---|
| Maximum | 199.99864 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 150.00004 |
|---|---|
| 5-th percentile | 152.4158 |
| Q1 | 162.47611 |
| median | 174.89991 |
| Q3 | 187.46442 |
| 95-th percentile | 197.45481 |
| Maximum | 199.99864 |
| Range | 49.998597 |
| Interquartile range (IQR) | 24.988307 |
Descriptive statistics
| Standard deviation | 14.44756 |
|---|---|
| Coefficient of variation (CV) | 0.082582446 |
| Kurtosis | -1.2008886 |
| Mean | 174.9471 |
| Median Absolute Deviation (MAD) | 12.506777 |
| Skewness | 0.0022729101 |
| Sum | 31490478 |
| Variance | 208.73198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 161.7779242 | 2 | < 0.1% |
| 185.0364512 | 2 | < 0.1% |
| 190.8724165 | 2 | < 0.1% |
| 154.9205933 | 2 | < 0.1% |
| 171.1549261 | 2 | < 0.1% |
| 196.1785543 | 2 | < 0.1% |
| 181.9949274 | 2 | < 0.1% |
| 198.5948733 | 2 | < 0.1% |
| 155.4997801 | 2 | < 0.1% |
| 174.7993486 | 2 | < 0.1% |
| Other values (98982) | 179980 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 150.0000414 | 1 | |
| 150.0003289 | 2 | |
| 150.0008757 | 1 | |
| 150.0009957 | 2 | |
| 150.0021254 | 2 | |
| 150.004363 | 2 | |
| 150.0043832 | 2 | |
| 150.0053413 | 2 | |
| 150.0070718 | 2 | |
| 150.0070791 | 2 |
| Value | Count | Frequency (%) |
| 199.9986387 | 1 | |
| 199.9979397 | 2 | |
| 199.9969655 | 1 | |
| 199.9968342 | 2 | |
| 199.9967708 | 2 | |
| 199.9954615 | 1 | |
| 199.99526 | 2 | |
| 199.9946035 | 2 | |
| 199.9935183 | 2 | |
| 199.993175 | 2 |
Weight
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 99026 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.971585 |
| Minimum | 40.000578 |
|---|---|
| Maximum | 99.999907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 40.000578 |
|---|---|
| 5-th percentile | 42.950006 |
| Q1 | 54.969838 |
| median | 69.979384 |
| Q3 | 84.980097 |
| 95-th percentile | 97.000203 |
| Maximum | 99.999907 |
| Range | 59.999329 |
| Interquartile range (IQR) | 30.01026 |
Descriptive statistics
| Standard deviation | 17.322574 |
|---|---|
| Coefficient of variation (CV) | 0.24756584 |
| Kurtosis | -1.2009818 |
| Mean | 69.971585 |
| Median Absolute Deviation (MAD) | 15.006275 |
| Skewness | 0.0048199774 |
| Sum | 12594885 |
| Variance | 300.07158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72.35494708 | 2 | < 0.1% |
| 89.94904672 | 2 | < 0.1% |
| 91.20816557 | 2 | < 0.1% |
| 62.32576927 | 2 | < 0.1% |
| 62.57737222 | 2 | < 0.1% |
| 57.13374418 | 2 | < 0.1% |
| 47.55671987 | 2 | < 0.1% |
| 51.31386665 | 2 | < 0.1% |
| 49.63270021 | 2 | < 0.1% |
| 84.1466654 | 2 | < 0.1% |
| Other values (99016) | 179980 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 40.00057777 | 2 | |
| 40.00071806 | 2 | |
| 40.00167958 | 2 | |
| 40.00211019 | 2 | |
| 40.00278931 | 2 | |
| 40.0033142 | 2 | |
| 40.0043693 | 2 | |
| 40.00469245 | 2 | |
| 40.00501204 | 2 | |
| 40.00506022 | 2 |
| Value | Count | Frequency (%) |
| 99.99990661 | 1 | |
| 99.99945951 | 2 | |
| 99.99872464 | 2 | |
| 99.99766773 | 2 | |
| 99.99710962 | 2 | |
| 99.99708677 | 1 | |
| 99.99698818 | 2 | |
| 99.9958793 | 2 | |
| 99.99574444 | 2 | |
| 99.9955893 | 2 |
Blood Type
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Memory size | 1.5 MiB |
| B | |
|---|---|
| O | |
| AB | |
| A |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.2471444 |
| Min length | 1 |
Characters and Unicode
| Total characters | 224486 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | O |
|---|---|
| 2nd row | B |
| 3rd row | A |
| 4th row | B |
| 5th row | O |
Common Values
| Value | Count | Frequency (%) |
| B | 45537 | |
| O | 45511 | |
| AB | 44486 | |
| A | 44466 | |
| (Missing) | 20000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 45537 | |
| o | 45511 | |
| ab | 44486 | |
| a | 44466 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 90023 | |
| A | 88952 | |
| O | 45511 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 224486 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 90023 | |
| A | 88952 | |
| O | 45511 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 224486 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 90023 | |
| A | 88952 | |
| O | 45511 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224486 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 90023 | |
| A | 88952 | |
| O | 45511 |
BMI
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 98983 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.338869 |
| Minimum | 10.074837 |
|---|---|
| Maximum | 44.355113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 10.074837 |
|---|---|
| 5-th percentile | 13.077572 |
| Q1 | 17.858396 |
| median | 22.671401 |
| Q3 | 27.997487 |
| 95-th percentile | 36.309183 |
| Maximum | 44.355113 |
| Range | 34.280276 |
| Interquartile range (IQR) | 10.139092 |
Descriptive statistics
| Standard deviation | 7.0335537 |
|---|---|
| Coefficient of variation (CV) | 0.30136651 |
| Kurtosis | -0.42473137 |
| Mean | 23.338869 |
| Median Absolute Deviation (MAD) | 5.0210732 |
| Skewness | 0.43835089 |
| Sum | 4200996.5 |
| Variance | 49.470878 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 27.64583507 | 2 | < 0.1% |
| 21.31288797 | 2 | < 0.1% |
| 17.88748454 | 2 | < 0.1% |
| 17.1649642 | 2 | < 0.1% |
| 24.57522201 | 2 | < 0.1% |
| 17.08301526 | 2 | < 0.1% |
| 14.97046433 | 2 | < 0.1% |
| 20.59578432 | 2 | < 0.1% |
| 35.26719381 | 2 | < 0.1% |
| 28.65053448 | 2 | < 0.1% |
| Other values (98973) | 179980 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 10.07483709 | 2 | |
| 10.0814309 | 2 | |
| 10.0901314 | 2 | |
| 10.10207858 | 2 | |
| 10.11272071 | 2 | |
| 10.12641305 | 2 | |
| 10.12673379 | 1 | |
| 10.13625472 | 2 | |
| 10.13677254 | 2 | |
| 10.13936456 | 2 |
| Value | Count | Frequency (%) |
| 44.3551126 | 1 | |
| 44.31407361 | 2 | |
| 44.28800321 | 2 | |
| 44.19402058 | 2 | |
| 44.17538675 | 1 | |
| 44.15754669 | 2 | |
| 44.09093573 | 2 | |
| 44.07686164 | 2 | |
| 44.07419486 | 2 | |
| 44.0514756 | 2 |
Temperature
Real number (ℝ)
MISSING 
| Distinct | 99006 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.600948 |
| Minimum | 96.397835 |
|---|---|
| Maximum | 100.82486 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 96.397835 |
|---|---|
| 5-th percentile | 97.778561 |
| Q1 | 98.26475 |
| median | 98.599654 |
| Q3 | 98.940543 |
| 95-th percentile | 99.426585 |
| Maximum | 100.82486 |
| Range | 4.4270217 |
| Interquartile range (IQR) | 0.67579279 |
Descriptive statistics
| Standard deviation | 0.50053017 |
|---|---|
| Coefficient of variation (CV) | 0.0050763221 |
| Kurtosis | 0.0069486449 |
| Mean | 98.600948 |
| Median Absolute Deviation (MAD) | 0.33804288 |
| Skewness | 0.010467187 |
| Sum | 17748171 |
| Variance | 0.25053045 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98.89844658 | 2 | < 0.1% |
| 98.28320066 | 2 | < 0.1% |
| 98.058989 | 2 | < 0.1% |
| 98.08941353 | 2 | < 0.1% |
| 98.74886556 | 2 | < 0.1% |
| 98.16848078 | 2 | < 0.1% |
| 98.60092163 | 2 | < 0.1% |
| 97.99633484 | 2 | < 0.1% |
| 98.51004203 | 2 | < 0.1% |
| 98.58211416 | 2 | < 0.1% |
| Other values (98996) | 179980 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 96.3978355 | 2 | |
| 96.59628951 | 2 | |
| 96.6095813 | 1 | |
| 96.64100649 | 2 | |
| 96.69389468 | 2 | |
| 96.75459508 | 2 | |
| 96.75597703 | 2 | |
| 96.76091358 | 2 | |
| 96.78533653 | 2 | |
| 96.81438808 | 2 |
| Value | Count | Frequency (%) |
| 100.8248572 | 2 | |
| 100.7737647 | 2 | |
| 100.7456864 | 2 | |
| 100.6533907 | 2 | |
| 100.6128156 | 1 | |
| 100.5880979 | 2 | |
| 100.5874788 | 2 | |
| 100.5761755 | 2 | |
| 100.5664977 | 2 | |
| 100.5348293 | 2 |
Heart Rate
Real number (ℝ)
MISSING 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.503767 |
| Minimum | 60 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 70 |
| median | 80 |
| Q3 | 90 |
| 95-th percentile | 97 |
| Maximum | 99 |
| Range | 39 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 11.540755 |
|---|---|
| Coefficient of variation (CV) | 0.14515985 |
| Kurtosis | -1.1981236 |
| Mean | 79.503767 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.0010632541 |
| Sum | 14310678 |
| Variance | 133.18903 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77 | 4679 | 2.3% |
| 97 | 4613 | 2.3% |
| 63 | 4603 | 2.3% |
| 92 | 4597 | 2.3% |
| 73 | 4588 | 2.3% |
| 70 | 4579 | 2.3% |
| 74 | 4576 | 2.3% |
| 71 | 4573 | 2.3% |
| 61 | 4571 | 2.3% |
| 88 | 4561 | 2.3% |
| Other values (30) | 134060 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 60 | 4517 | |
| 61 | 4571 | |
| 62 | 4453 | |
| 63 | 4603 | |
| 64 | 4544 | |
| 65 | 4425 | |
| 66 | 4400 | |
| 67 | 4269 | |
| 68 | 4560 | |
| 69 | 4317 |
| Value | Count | Frequency (%) |
| 99 | 4478 | |
| 98 | 4513 | |
| 97 | 4613 | |
| 96 | 4430 | |
| 95 | 4451 | |
| 94 | 4471 | |
| 93 | 4492 | |
| 92 | 4597 | |
| 91 | 4506 | |
| 90 | 4504 |
Blood Pressure
Real number (ℝ)
MISSING 
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 114.55803 |
| Minimum | 90 |
|---|---|
| Maximum | 139 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 90 |
|---|---|
| 5-th percentile | 92 |
| Q1 | 102 |
| median | 115 |
| Q3 | 127 |
| 95-th percentile | 137 |
| Maximum | 139 |
| Range | 49 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.403353 |
|---|---|
| Coefficient of variation (CV) | 0.12572975 |
| Kurtosis | -1.1954515 |
| Mean | 114.55803 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.0048300908 |
| Sum | 20620446 |
| Variance | 207.45658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 106 | 3823 | 1.9% |
| 117 | 3741 | 1.9% |
| 109 | 3724 | 1.9% |
| 135 | 3714 | 1.9% |
| 97 | 3709 | 1.9% |
| 91 | 3703 | 1.9% |
| 131 | 3697 | 1.8% |
| 136 | 3697 | 1.8% |
| 116 | 3696 | 1.8% |
| 128 | 3682 | 1.8% |
| Other values (40) | 142814 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 90 | 3520 | |
| 91 | 3703 | |
| 92 | 3539 | |
| 93 | 3507 | |
| 94 | 3551 | |
| 95 | 3451 | |
| 96 | 3586 | |
| 97 | 3709 | |
| 98 | 3531 | |
| 99 | 3552 |
| Value | Count | Frequency (%) |
| 139 | 3584 | |
| 138 | 3630 | |
| 137 | 3431 | |
| 136 | 3697 | |
| 135 | 3714 | |
| 134 | 3601 | |
| 133 | 3599 | |
| 132 | 3566 | |
| 131 | 3697 | |
| 130 | 3580 |
Cholesterol
Real number (ℝ)
MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 184.48636 |
| Minimum | 120 |
|---|---|
| Maximum | 249 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 120 |
|---|---|
| 5-th percentile | 126 |
| Q1 | 152 |
| median | 184 |
| Q3 | 217 |
| 95-th percentile | 243 |
| Maximum | 249 |
| Range | 129 |
| Interquartile range (IQR) | 65 |
Descriptive statistics
| Standard deviation | 37.559678 |
|---|---|
| Coefficient of variation (CV) | 0.20359054 |
| Kurtosis | -1.2024382 |
| Mean | 184.48636 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | 0.0041550445 |
| Sum | 33207545 |
| Variance | 1410.7294 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 223 | 1534 | 0.8% |
| 155 | 1501 | 0.8% |
| 211 | 1478 | 0.7% |
| 215 | 1476 | 0.7% |
| 249 | 1473 | 0.7% |
| 157 | 1457 | 0.7% |
| 245 | 1455 | 0.7% |
| 127 | 1453 | 0.7% |
| 150 | 1451 | 0.7% |
| 161 | 1450 | 0.7% |
| Other values (120) | 165272 | |
| (Missing) | 20000 | 10.0% |
| Value | Count | Frequency (%) |
| 120 | 1400 | |
| 121 | 1441 | |
| 122 | 1373 | |
| 123 | 1421 | |
| 124 | 1310 | |
| 125 | 1365 | |
| 126 | 1384 | |
| 127 | 1453 | |
| 128 | 1328 | |
| 129 | 1377 |
| Value | Count | Frequency (%) |
| 249 | 1473 | |
| 248 | 1343 | |
| 247 | 1317 | |
| 246 | 1358 | |
| 245 | 1455 | |
| 244 | 1406 | |
| 243 | 1422 | |
| 242 | 1402 | |
| 241 | 1387 | |
| 240 | 1401 |
Diabetes
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Memory size | 390.8 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 161986 | |
| True | 18014 | 9.0% |
| (Missing) | 20000 | 10.0% |
Smoking
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20000 |
| Missing (%) | 10.0% |
| Memory size | 390.8 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 143971 | |
| True | 36029 | 18.0% |
| (Missing) | 20000 | 10.0% |
| Student ID | Age | Height | Weight | BMI | Temperature | Heart Rate | Blood Pressure | Cholesterol | Gender | Blood Type | Diabetes | Smoking | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Student ID | 1.000 | 0.000 | 0.003 | -0.001 | -0.003 | 0.004 | 0.004 | 0.002 | -0.003 | 0.011 | 0.007 | 0.009 | 0.003 |
| Age | 0.000 | 1.000 | 0.002 | 0.005 | 0.003 | 0.003 | 0.005 | -0.004 | -0.001 | 0.009 | 0.012 | 0.005 | 0.010 |
| Height | 0.003 | 0.002 | 1.000 | -0.000 | -0.524 | -0.005 | 0.006 | 0.004 | 0.002 | 0.008 | 0.007 | 0.011 | 0.006 |
| Weight | -0.001 | 0.005 | -0.000 | 1.000 | 0.841 | 0.000 | -0.000 | 0.002 | -0.002 | 0.006 | 0.006 | 0.006 | 0.007 |
| BMI | -0.003 | 0.003 | -0.524 | 0.841 | 1.000 | 0.001 | -0.003 | -0.001 | -0.003 | 0.004 | 0.006 | 0.008 | 0.005 |
| Temperature | 0.004 | 0.003 | -0.005 | 0.000 | 0.001 | 1.000 | -0.005 | -0.004 | -0.000 | 0.000 | 0.007 | 0.006 | 0.005 |
| Heart Rate | 0.004 | 0.005 | 0.006 | -0.000 | -0.003 | -0.005 | 1.000 | 0.003 | 0.005 | 0.009 | 0.005 | 0.012 | 0.003 |
| Blood Pressure | 0.002 | -0.004 | 0.004 | 0.002 | -0.001 | -0.004 | 0.003 | 1.000 | 0.003 | 0.009 | 0.003 | 0.005 | 0.005 |
| Cholesterol | -0.003 | -0.001 | 0.002 | -0.002 | -0.003 | -0.000 | 0.005 | 0.003 | 1.000 | 0.004 | 0.006 | 0.005 | 0.003 |
| Gender | 0.011 | 0.009 | 0.008 | 0.006 | 0.004 | 0.000 | 0.009 | 0.009 | 0.004 | 1.000 | 0.006 | 0.002 | 0.002 |
| Blood Type | 0.007 | 0.012 | 0.007 | 0.006 | 0.006 | 0.007 | 0.005 | 0.003 | 0.006 | 0.006 | 1.000 | 0.000 | 0.010 |
| Diabetes | 0.009 | 0.005 | 0.011 | 0.006 | 0.008 | 0.006 | 0.012 | 0.005 | 0.005 | 0.002 | 0.000 | 1.000 | 0.000 |
| Smoking | 0.003 | 0.010 | 0.006 | 0.007 | 0.005 | 0.005 | 0.003 | 0.005 | 0.003 | 0.002 | 0.010 | 0.000 | 1.000 |
| Student ID | Age | Gender | Height | Weight | Blood Type | BMI | Temperature | Heart Rate | Blood Pressure | Cholesterol | Diabetes | Smoking | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.0 | 18.0 | Female | 161.777924 | 72.354947 | O | 27.645835 | NaN | 95.0 | 109.0 | 203.0 | No | NaN |
| 1 | 2.0 | NaN | Male | 152.069157 | 47.630941 | B | NaN | 98.714977 | 93.0 | 104.0 | 163.0 | No | No |
| 2 | 3.0 | 32.0 | Female | 182.537664 | 55.741083 | A | 16.729017 | 98.260293 | 76.0 | 130.0 | 216.0 | Yes | No |
| 3 | NaN | 30.0 | Male | 182.112867 | 63.332207 | B | 19.096042 | 98.839605 | 99.0 | 112.0 | 141.0 | No | Yes |
| 4 | 5.0 | 23.0 | Female | NaN | 46.234173 | O | NaN | 98.480008 | 95.0 | NaN | 231.0 | No | No |
| 5 | 6.0 | 32.0 | NaN | 151.491294 | 68.647805 | B | 29.912403 | 99.668373 | 70.0 | 128.0 | 183.0 | NaN | Yes |
| 6 | 7.0 | 21.0 | NaN | 172.949704 | 48.102744 | AB | 16.081635 | 97.715469 | 66.0 | 134.0 | 247.0 | No | No |
| 7 | 8.0 | 28.0 | Male | 186.489402 | 52.389752 | AB | 15.063921 | 98.227788 | 85.0 | 123.0 | 128.0 | No | No |
| 8 | 9.0 | 21.0 | Male | 155.039678 | 42.958703 | B | NaN | 98.808053 | NaN | 111.0 | 243.0 | No | No |
| 9 | 10.0 | 32.0 | NaN | 170.836315 | 50.783250 | B | 17.400435 | 98.570168 | 61.0 | 94.0 | 166.0 | NaN | No |
| Student ID | Age | Gender | Height | Weight | Blood Type | BMI | Temperature | Heart Rate | Blood Pressure | Cholesterol | Diabetes | Smoking | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199990 | 99991.0 | 21.0 | Female | 183.735110 | 51.172076 | AB | 15.158238 | 97.998790 | 67.0 | 96.0 | 249.0 | NaN | Yes |
| 199991 | 99992.0 | 28.0 | Male | 183.499177 | NaN | A | 26.527962 | 97.321680 | 70.0 | 113.0 | 140.0 | No | No |
| 199992 | 99993.0 | 34.0 | Male | 161.590030 | 90.877589 | B | 34.803881 | 98.728836 | 70.0 | 96.0 | 208.0 | No | No |
| 199993 | 99994.0 | 22.0 | Male | NaN | 46.155224 | A | NaN | 98.331019 | 93.0 | 100.0 | NaN | Yes | No |
| 199994 | 99995.0 | 22.0 | Male | 159.486907 | NaN | A | 27.631082 | 98.971976 | 86.0 | 134.0 | 208.0 | No | NaN |
| 199995 | NaN | 24.0 | Male | 176.503260 | 95.756997 | B | 30.737254 | 99.170685 | 65.0 | 121.0 | 130.0 | No | No |
| 199996 | 99997.0 | 29.0 | Female | 163.917675 | 45.225194 | NaN | 16.831734 | 97.865785 | 62.0 | 125.0 | 198.0 | No | Yes |
| 199997 | 99998.0 | 34.0 | Female | NaN | 99.648914 | NaN | 33.189303 | 98.768210 | 60.0 | 90.0 | 154.0 | NaN | No |
| 199998 | 99999.0 | 30.0 | Female | 156.446944 | 50.142824 | A | 20.486823 | 98.994212 | 61.0 | 106.0 | 225.0 | No | No |
| 199999 | 100000.0 | 20.0 | Female | 153.927409 | 99.928405 | O | 42.175189 | 98.595817 | 95.0 | 133.0 | 132.0 | NaN | No |
Most frequently occurring
| Student ID | Age | Gender | Height | Weight | Blood Type | BMI | Temperature | Heart Rate | Blood Pressure | Cholesterol | Diabetes | Smoking | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 8.0 | 28.0 | Male | 186.489402 | 52.389752 | AB | 15.063921 | 98.227788 | 85.0 | 123.0 | 128.0 | No | No | 2 |
| 1 | 12.0 | 34.0 | Female | 182.416302 | 76.371050 | AB | 22.950992 | 98.118274 | 86.0 | 97.0 | 247.0 | No | No | 2 |
| 2 | 19.0 | 31.0 | Female | 158.790160 | 46.829849 | AB | 18.572723 | 98.784709 | 92.0 | 102.0 | 172.0 | NaN | No | 2 |
| 3 | 23.0 | 29.0 | Female | 179.909041 | 90.679436 | AB | 28.015787 | 98.782269 | 81.0 | 108.0 | 227.0 | No | Yes | 2 |
| 4 | 24.0 | 18.0 | Male | NaN | 52.521560 | AB | 13.570402 | 98.215090 | 60.0 | 132.0 | 217.0 | No | No | 2 |
| 5 | 25.0 | 27.0 | Female | 187.411623 | 81.219470 | AB | 23.124221 | 97.738939 | 99.0 | 135.0 | 123.0 | No | No | 2 |
| 6 | 36.0 | 21.0 | Male | 183.476287 | 61.469995 | O | 18.260106 | 99.346920 | 62.0 | 127.0 | 233.0 | No | No | 2 |
| 7 | 52.0 | 23.0 | Male | 174.338438 | 45.421333 | AB | 14.944231 | 98.672024 | 89.0 | 101.0 | 236.0 | No | No | 2 |
| 8 | 79.0 | 27.0 | Female | 187.852506 | 82.352369 | B | 23.336843 | 98.300921 | 78.0 | 99.0 | 206.0 | No | No | 2 |
| 9 | 87.0 | 34.0 | Female | 150.942632 | 90.580214 | O | 39.756624 | 97.563234 | 79.0 | 135.0 | 198.0 | Yes | No | 2 |